MEMORY EGRESS SELF SELECTION ARCHITECTURE 
CROSS REFERENCE TO RELATED APPLICATIONS 

[0001] This application claims the benefit of priority of U.S. Provisional Application 

No. 60/458,375 filed March 31, 2003, which is incorporated herein by reference. 

FIELD OF THE INVENTION 

[0002] The present invention relates generally to memory management in time- 

division multiplexing (TDM) switches. More particularly, the present invention relates to the 
storage of incoming grains in a TDM switch prior to retransmission. 

BACKGROUND OF THE INVENTION 

[0003] In telecommunication switching, time-division multiplex switching is commonly 

implemented. In such a system, a plurality of ingress ports provide a switch with grains 
containing data, and the received grains are multiplexed to one or more egress ports. The 
multiplexing of grains from ingress to egress ports is commonly done using a fixed mapping. 
Such time division multiplexed switches are commonly used to cross*connect lower rate 
signals (grains) that are contained within a higher rate signal, also referred to as a grain 
group. A grain group is composed of a fixed number of grains. In a standard synchronous 
optical network (SONET) based STS-48 channel carrying a VT1.5 virtual tributary, for 
example, each VT1.5 is a grain and the 1344 VT1.5s in the channel form a grain group. 
[0004] Figure 1 illustrates a plurality of data channels, received by ports A through N. 

Each port receives grain groups 100 containing G grains, one such grain being labelled as 
102. Each grain 102 may be associated with an independent client and has a fixed location 
within the grain group 100. The entire grain group 100 is presented once every G clock 
cycles. Clock cycles are often referred to as time-slots, the use of either term should be 
considered interchangeable. An individual datum from a given client Is available at each 
occurrence of the associated grain 102. The grain groups 100 in all the ports are shown to be 
aligned. This is a common requirement in TDM switches. In SONET switches, for example, 
an external alignment signal, which provides timing to the SONET transport frame or the 
tributary multi-frame, is used to align the grain groups. 

[0005] Figure 2 illustrates the general architecture of a TDM switch. In general, the 

switch has a number of inputs and number of outputs. Data grains 102received by ingress 
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ports (not shown) are ordered as grains groups 100. The grains 102 are multiplexed by the 
switch 104 and transmitted by the egress ports (not shown) ordered as egress grain groups 
106. For illustrative purposes, the switching system illustrated in Figure 2 has N Inputs and N 
outputs (NxN), but one skilled in the art will appreciate that a similar architecture can be 
implemented for asymmetrical switches. In Figure 2, grains 102 are labelled with a letter 
representing the input port and a number (ranging from 1 to G, the number of grains in the 
grain group 100) representing the byte position at the port. As can be seen in the output, 
bytes may be reordered to any position, may be multicast to several ports (or a single port), 
or may be dropped. Figure 2 shows switching of grains within one grain group. The same 
switching of grains is performed at all recurrences of the grain groups. 
[0006] Figure 3 illustrates the conceptual implementation of an N x N memory switch 

104. Two memories. 108a and 108b, are typically employed, each memory having N write 
ports 110 and N read ports 112. Two memories are used so that ingress ports can write a 
grain group to one memory using write ports 110 while the egress ports read a grain group 
from the other memory using read ports 112. The specific memory written to or read from 
alternates every grain group. The ingress ports write data into memory at an address that is 
indexed by the ingress port and timeslot numbers of the ingress grain. Egress ports read 
data from the address required to select the desired ingress grain thereby achieving the 
switching of ingress data to egress ports. The read address of the egress ports is defined by 
a connection memory that is configured by the user. 

[0007] Figure 4 illustrates the above described method of multiplexing grains from the 

ingress ports to egress ports. The switch receives a grain as part of a grain group at each of 
the ingress ports in step 114. These grains are buffered in a memory in step 116. In many 
implementations, the memory is sized to store one grain group from each of the ingress 
ports. The process of receiving and buffering grains Is repeated until it has been determined 
that the last grain in a grain group has been received as shown in step 118. When the last 
grain in a grain group has been received, as determined in step 118i the process effectively 
forl<s. In the first fork, the memory bank used to buffer incoming grains in switched in step 
120. In the second path of the fork, the memory bank that was just filled with incoming grains 
is used to multiplex out the received grains in step 122. This reading of the grains preferably 
takes no more than the amount of time required to filla memory bank. The completely read 
out memory bank is then switched for a full memory bank in step 124. If the filling of a 
memory bank and the reading out a memory bank are timed properly, the process can repeat 
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without a delay. The TDM switch will alternate between memory banks, so that as one 
memory bank is filled, via steps 114 and 116, the other Is emptied via step 122. The memory 
banks are then switched in steps 120 and 124, preferably simultaneously. 
[0008] It is common for switches to have a plurality of ingress and egress ports, 

though, using current technology, it is not practical to build fast memories with more than 3 or 
4 ports. Generally, building a switch with multiple ingress and egress ports requires a slightly 
different architecture than the simplified illustration of Figure 3. Two of the most common 
implementations are the flip-flop-based implementation illustrated in Figure 5 and the RAM- 
based implementation illustrated in Figure 6. Both these implementations make use of dual 
memory banks, and have an operating method similar to that illustrated in Figure 4. 
[0009] The flip-flop based implementation of an N x N memory switch, as illustrated 

in Figure 5, works in two steps. First, the switch buffers all the incoming grains, received on 
ingress ports 110, in flip-flops, one of which is labelled 126, After receiving a full grain group, 
the grains are multiplexed out onto the egress ports 112. The multiplexing is performed 
according to the connection memory configuration 130 and requires a wide NG:1 multiplexer 
128 per memory bank. Double buffering is employed to store one set of ingress grains while 
the other set is being multiplexed out. This implementation requires G flif>flop storage 
elements 126 and a wide NG:1 multiplexer 128 for each egress port 112. The order of the 
area cost of a flip-flop based N x N memory switch is governed by the following equations: 
Area Cost nxn = Area ff + Area mux (1.1) 
Area CostNxN = Order ( NG ) + Order ( loga { NG ) ) (1 .2) 

Where N = number of ports 

G = number of grains in a grain group 
[0010] The random access memory (RAM) based implementation of an N x N 
memory switch works with the same two operational steps. First, the switch buffers all the 
incoming grains, received on the ingress ports 110, in RAMs, one of which is labelled 132. It 
then multiplexes the grains out onto the egress ports 112 according to the connection 
memory configuration 130. Double buffering is employed to store one set of ingress grains 
while the other set is being multiplexed out. The main difference from the flip-flop based 
implementation is all the ingress grains are stored in RAMs 132 instead of flip-flops. A RAM 
is more area efficient, but only one grain can be accessed at a time. Consequently, each 
ingress grain must be stored on a per egress port basis rather then on a per switch basis. 
Thus, the RAM based implementation minimizes the width of the egress multiplexer 134 to 
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N:1 per egress port but requires NG storage elements 132 per egress port. The order of the 
area cost of a memory based N x N memory switch is governed by the following equations: 
Area Cost n x n = Area ram + Area mux (2.1) 
Area Cost n x n = Order ( N^G ) + Order ( logaC N ) ) (2.2) 
Where N = number of ports 

G = number of grains in a grain group 
[0011] Telecommunication systems typically have a maximum physical area and cost 

budget for printed circuit boards. Minimizing the physical area and cost of a printed circuit 
boards allows network equipment vendors to minimize equipment cost in order to gain 
market share. Since both physical area and cost are derived from both the number and 
physical dimension of integrated components, reducing the number and size of those 
components will have a positive effect on both the physical area and cost of the printed 
circuit board. In particular, memory switch area grows with switch capacity, thus it Is 
important to reduce area in switching components to enable the design of large switching 
systems with a minimal number of components. 

[0012] Both the implementation of Figure 5 and the implementation of Figure 6 

consist of a memory to store ingress grains for subsequent output to the egress ports and 
multiplexing logic to select the desired ingress grain. The area costs for these 
implementations are defined above in equations 1.1, 1.2, 2.1 and 2.2. 
[0013] In the flip-flop based implementation of Figure 5, where a wide NG:1 

multiplexer per egress port is required, the multiplexer is able to select NG ingress grains but 
only G ingress grains are selected per egress port. This results in a sub-optimal utilizationof 
the dedicated resources. The following equations give the utilization percentage of the 
egress multiplexer in a flip-flop based implementation of an N x N memory switch as 
illustrated in Figure 5. 

Utilization % n x n = [Tot egress mux - Inactive egress mux] / [Tot egress mux] (3.1 ) 
Utilization % n x n = [N^G loga (NG) - N(N-1 )G logs ((N-1 )G)] / [N^G log> (NG)] (3.2) 
Where N = number of ports 

G = number of grains in a grain group 
For N = 18 and G = 1344 Utilization % nxn = 6.1 % 

(i.e. 45G SONET/SDH VT/TU cross connect) 

[0014] In the RAM based Implementation of Figure 6 which requires N RAMs per 

egress port, the RAMs hold NG ingress grains but only G ingress grains are read out per 
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egress port. The area is not optimal with respect to the storage element. The following 
equations gives the utilization percentage of the storage element in a RAM based 
implementation of an N x N memory switch. 

Utilization % N X N " [Tot storage — Unused storage] / [Tot storage] (4.1 ) 

Utilization % n x n = [N^G - N(N-1 )G1 / [N^G] (4.2) 
Utilization%NxN = 1 / N (4,3) 
Where N = number of ports 

G == number of grains in a grain group 
For N = 1 8 and G = 1 344 Utilization % n x n = 5.5 % 

(i.e. 45G SONET/SDH VTn"U cross connect) 

[0015] Though the RAM and flip-flop architectures allow for multiplexing the grains 

received by the ingress ports to the egress ports in a time efficient manner, their space 
consumption requires large implementations in chips and on printed circuit boards. These 
two factors increase their cost. The large implementation area is related to the poor utilization 
of multiplexing capacity and RAM storage. It is, therefore, desirable to provide a multiplexing 
egress structure that reduces the implementation size and increases the utilization 
percentage. 

SUMMARY OF THE INVENTION 

[0016] It is an object of the present invention to obviate or mitigate at least one 

disadvantage of previous time division multiplexing switches. 

[0017] In a first aspect of the present invention, there is provided a time division 

multiplexing switch. The TDM switch comprises a plurality of ingress ports and at least one 
memory egress self selection (MESS) egress port. Each of the ingress ports in the plurality 
synchronously receives data grains at fixed time intervals. The received data grains are 
ordered as grain groups. The at least one MESS egress port receives the data grains from 
the plurality of ingress ports, preferably as an aggregate, and transmits stored data grains in 
a predetermined order. The at least one MESS egress port has a data grain selector for 
selecting data grains from the received data grains in accordance with at least one 
predetermined criterion prior to storing the selected grains for transmission by the MESS 
egress port. 

[0018] In an embodiment of the first aspect of the present invention the plurality of 

ingress ports and the at least one memory egress self selection egress ports are equal in 

-5- 



number, so that the switch is a symmetrical switch. In another embodiment of the present 
invention, the switch includes a grain aggregator, operatively connected to the plurality of 
ingress ports, for aggregating the data grains received by each of the ingress ports and for 
providing the aggregate to the at least one MESS egress port. 

[0019] In a further embodiment of the first aspect of the present Invention, the data 

grain selector Includes an ingress processor for both receiving the aggregate and for 
selecting data grains from the aggregate for storage In accordance with the at least one 
predetermined criterion. In a presently prefen-ed embodiment, the ingress processor 
includes both an interest memory for storing a grain mask conresponding to the 
predetermined grain selection criterion and a finite state machine for selecting grains from 
the aggregate for storage in accordance with the grain mask. In afurther embodiment, the 
grain mask is based on the associated ingress port and the position of the grain the 
respective grain group. 

[0020] In another embodiment, each of the MESS egress ports includes a memory 
and an egress processor. The memory stores the data grains selected by the ingress 
processor. The egress processor reads and transmits the stored data grains from the 
memory in a predetermined order. In other embodiments the ingress processor includes a 
memory compactor for addressing the selected data grains for storage in the memory without 
memory fragmentation, the memory compactor preferably including, or using an already 
existing finite state machine for selecting the first available location in memory as well as a 
plurality of multiplexers for multiplexing the selected data grains into the memory. In other 
embodiments, the memory stores only the selected data grains and is preferably sized to 
store exactly one grain group. In other embodiments the egress processor includes an 
egress processing memory for storing the predetermined order for reading and transmitting 
the stored data grains and optionally the egress processor is connected to a connection 
memory for storing connection information which additionally provides the interest RAM with 
the predetermined criterion and the egress processing memory with the predetermined order 
in accordance with the stored connection information. 

[0021] In a further embodiment, the egress processor includes an N:1 multiplexer 

attached to the memory for reading and sequentially transmitting the stored data grains in the 
predetermined order, where N is the number of ingress ports. In another embodiment, the 
egress processor Includes an N:M mulitplexer attached to the memory for reading and 



sequentially transmitting a plurality of data grains in the predetermined order, where N is the 
number of ingress ports and M > 1 . 

[0022] In another aspect of the present invention, there is provided a method of time 

division multiplex switching received data grains to at least one egress port. The method 
comprises the steps of: receiving and aggregating a plurality of data grains received in a 
single timeslot at a number of ingress ports, each of the plurality of data grains being 
associated with a grain group; transferring the aggregate of the received data grains to the at 
least one egress port; selecting from the aggregate the data grains to be transmitted by the 
at least one egress port; storing the selected data grains at the least one egress port; and 
transmitting the stored data grains from the at least one egress port In a predetenmined 
order. 

[0023] In embodiments of the present invention the step of selecting Includes 

applying a mask to the aggregate to select grains in accordance with the Ingress port 
associated with the position of the grain in the aggregate and the position of the grain in its 
respective grain group and the step of storing the selected grains includes storing only the 
selected grains. In another embodiment, the step of storing the selected grains includes 
compactly storing the selected grains in a memory and the step of transmitting includes 
reading stored grains from the memory in a predetermined order. 

[0024] Other aspects and features of the present invention will become apparent to 

those ordinarily skilled in the art upon review of the following description of specific 
embodiments of the Invention in conjunction with the accompanying figures. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0025] Embodiments of the present invention will now be described, by way of 

example only, with reference to the attached Figures, wherein: 

Figure 1 is an illustration of a plurality of grain groups transmitted on a plurality 
of channels; 

Figure 2 is a block diagram illustration of a conventional time division 
multiplexing switch architecture; 

Figure 3 is a prior art implementation of a time division multiplexing switch 
architecture; 

Figure 4 is a flow chart Illustrating a prior art implementation of a method to 
perform time division multiplex switching; 
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Figure 5 is a block diagram of a prior art flip-flop based time division 
multiplexing switch; 

Figure 6 is a block diagram of a prior art RAM based time division multiplexing 

switch; 

Figure 7 Is a block diagram of a system architecture of the present invention; 
Figure 8 is a block diagram illustrating the architecture of a system of the 
present invention; 

Figure 9 is a flowchart illustrating a method of time division multiplex switching 
according to the present invention; 

Figure 10 illustrates a memory egress self selection port according to one 
embodiment of the present invention; 

Figure 1 1 illustrates the data RAM of a memory egress self selection port 
during operation according to one embodiment of the present invention; 

Figure 12 illustrates the interaction between the ingress processor and data 
RAM of a memory egress self selection port according to one embodiment of the 
present invention; and 

Figure 13 illustrates the interaction between the egress processor and data 
RAM of a memory egress self selection port according to one embodiment ofthe 
present invention. 

DETAILED DESCRIPTION 

[0026] Generally, the present invention provides a method and system for 

multiplexing data grains received on a number of ingress ports to a number of egress ports. 
For the purposes of illustration, the number of ingress ports and egress ports will be identical, 
though one skilled in the art will readily appreciate that the teachings below can be applied to 
asymmetrical switching systems. 

[0027] The present invention provides a time division multiplex (TDM) switch that 

increases the percentage utilization of components in comparison to many prior art switches. 
The present invention also provides a novel method of multiplexing data grains received on 
ingress ports to egress ports. The present invention provides a system and method of time 
division multiplex switching data grains received on ingress ports to a number of egress 
ports, each egress port storing selected received grains In a memory for transmission by the 
egress port. 
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[0028] Figure 7 illustrates the architecture of a system cf the present invention. 

Ingress ports 1 through N, one of which is labelled as 110, receive data grains ordered as 
grain groups. Preferably the grains are received synchronously across the nunnber of ingress 
ports. From ingress ports 110, the grains are aggregated by aggregator 202 into an N grain 
wide channel. For the purposes of this exemplary illustration, each data grain is assumed to 
be 1 byte in size, resulting in a requirement for an N byte wide channel to con^espond to the 
N ingress ports. The N byte word representing the received grains is provided to the memory 
egress self selection (MESS) egress ports, one of which is labelled 204. Each MESS port 
204 stores only the grains that it will transmit in an internal memory. Because each MESS 
port 204 has its own internal memory, the complex multiplexing circuitry utilized by the flip- 
flop based implementation is eliminated in favor of the simplified multiplexing circuitry of the 
RAM based implementation previously described. However, MESS port 204 is provided with 
processing capacity to determine which data grains are of interest, that is, which received 
data grains will be transmitted over the particular MESS port 204 By discarding the grains of 
no interest, or conversely by selecting the grains of interest, only the selected grains need to 
be stored which reduces the required memory storage for each port. 
[0029] Figure 8 provides a more detailed architecture for a system of the present 

invention. Data grains are received by ingress ports, one of which is labeled as 110. The 
grains received by the ingress ports are provided to ingress processor 206 in MESS egress 
port 204. Ingress processor 206 selects data grains from the ingress ports 110 for storage in 
memory 208. Whereas the RAM based implementation required N RAMs of G bytes, the 
present invention reduces the required RAM, as only the grains to be transmitted by the 
egress port are stored. As a statistical average, this means that each of the N memories can 
be reduced in size to G/N from G. Over N ingress and egress ports, this reduces the memory 
size to G bytes from an original GN bytes. To multiplex out the stored grains while new 
grains are received, a double buffering implementation can be employed. The order of the 
addresses to be read out from the memory is provided by an egress processor 212 which 
accesses connection memory 214. The reading of the stored grains uses a simple N:1 
multiplexer 210 on a per port basis. This combines the reduced memory requirement 
advantage of the standard flip-flop implementation with the simplified multiplexing circuitry 
advantage of the RAM implementation. 

[0030] In operation, as well as in structure, the system illustrated in Figure 8 differs 

from previous implementations of TDM switches. Where known TDM switches store all the 
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ingress grains, the MESS implementation of the present invention preferably stores only the 
selected ingress grains at each egress port. There are still N RAMs 208 but each RAM is 
1/Nth the size of the ones in the traditional RAM based implementation. The ingress 
processor 206 preferably packs the selected grains into the N RAMs 208to avoid holes, also 
referred to as memory fragmentation. The next step is also different from both previous 
implementations which multiplex out the grains on the egress ports according to the 
connection memory configuration. In the MESS implementation of the present invention, the 
egress processor 212 multiplexes out the grains based on where in RAM 208 the ingress 
processor 206 has stored the data. Use of connection memory 214 is presently preferred for 
properly configuring both the ingress and egress processors 206 and 212 respectively to 
enable the mapping between ingress and egress ports. Double buffering is also preferably 
utilized in order to store one set of ingress grains while the other set is being multiplexed out. 
There are two extra blocks in this implementation: the ingress processor 206 and egress 
processor 212. The area of those two blocks is negligible when the grain count is large. As a 
result, the MESS implementation of the present invention reduces both the width of the 
egress multiplexer 210 and the number of storage elements G per egress port. The order of 
the area cost of a MESS based N x N memory switch is governed by the following equations. 
Area Cost n x n = Area ram + Area mux + Area ingress proc + Area Cost egress proc (5 - 1 ) 
Area Cost n x n = Order (NG) + Order (N^log2(N))+Order (N2G+N^log2(N))+ (5.2) 

Order(NGIog2(N)+ NGIog2(G/N)) 
Where N = number of ports 

G = number of grains in a grain group 
[0031] The MESS based implementation of an N x N memory switch illustrated in 

Figure 8 achieves both reductions of the egress multiplexing per egress port to one N:1 
multiplexer and the reduction of the minimum storage elements per egress port to G memory 
units. The utilization percentage of both the egress multiplexer and the storage element is 
100 % in the illustrated implementation. 

[0032] Figure 9 Is a flow chart illustrating a method of the present invention. In this 

embodiment of the method it is assumed that the memory bank is sized the same as the 
grain group, so that each memory has a capacity of one grain group. Thus, when the end of 
a grain group is identified the memory bank is filled. The ingress ports receive data grains in 
step 216. The received grains from each of the ingress ports is provided to the ingress 
processor in step 218. In step 220 grains selected from those received by the ingress 
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processor in step 218 are stored in the memory. In step 222a determination as to whether 
the most recently received grains are the end of the grain group is made. If the end of the 
grain group has not been reached the process returns to step 216 If the end of the grain 
group has been reached the process forks. In the first fork, the memory bank is switched in 
step 224 and the process returns to step 216. In the second fork, a selected memory location 
in the recently filled memory is read out to the egress port in step 226 A determination is 
made as to whether the most recently read out grain is the end of the grain group in step 
228. If the end of the grain group has not been reached the process retums to step 226 and 
grains continue to be read out for each egress port until the end of the grain group has been 
reached, at which point the memory bank is switched in step 230. As the egress port should 
transmit a grain group in the same time as the ingress port receives a grain group, the 
switching of memory banks should happen simultaneously. 

[0033] Figure 10 illustrates a generic MESS Egress port 204. The wide ingress data 

bus feeds ingress processor 206 which selects and extracts grains and stores them in the 
data RAM 208. This allows for the reduction in the memory area allocation. The order of the 
selected grains in data RAM 208 is defined by the order in which the grains arrive at ingress 
processor 206. To obtain the correct output order, egress processor 212 obtains a ordered 
grain reading pattern from connection memory RAM 214. The order in which grains are to be 
read can be stored in any of a number of data structures. One skilledin the art will appreciate 
that any one of a number of standard listing techniques can be employed for this purpose. 
The egress processor 212 extracts the grains from the data RAMs and constructs the egress 
grain group, based on a structure and order provided by the connection memory RAM 214 
The MESS architecture achieves a reduction in the implementation area by minimizing the 
number of storage elements as well as by reducing the egress multiplexing complexity. 
[0034] Figure 1 1 illustrates a hardware circuit embodiment of the data RAMs 208 

used in one embodiment of the present invention. Two sets of data RAMs208 are employed 
to facilitate double buffering so that the ingress processor 206 can write a grain group to one 
data RAM 208 while the egress processor reads a grain group from the other data RAM 208 
with the specific memory written to, or read from, alternating at every grain group. In a 
presently preferred embodiment, each data RAM 208 can store exactly one grain group (G 
grains), which is the minimum size for a memory. The data RAMs 208may be constructed 
from a set of N independent. 1-byte wide RAMs, a single N-byte wide RAM, or combinations 
therebetween. Using a set of N RAMs is presently preferable, as it does not require buffering 
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at the ingress processor 206 and reduces power consumption on reads by a factor of N, in 
addition the use of N RAMs alls from between 0 to N different bytes to be written 
simultaneously. Because the total storage required is G bytes (assuming 1 byte grains), each 
of the N independent RAMs or the single wide RAM are G/N words deep. The writing of the 
RAMs is controlled by the ingress processor 206 and the reading of the RAMs is controlled 
by the egress processor 212. 

[0035] The ingress processor 206 writes data into data RAMs A and B 208 

sequentially starting from (RAM#1, Addr#0) to (RAM#N, Addr#0) then from (RAM #1, 
Addr#1) to (RAM#N. Addr#1) and then from (RAM #1, Addr#G/N-1) to (RAM#N. Addr#G/N- 
1). This results in a compact memory usage which avoids memory fragmentation, and results 
in complete usage of each memory location, one of which is labelled as 232. This technique 
is analogous to creating a brick wall, and as a result the memory structure is referred to as a 
"brick wair. The brick wall is restarted at the next grain group boundary in the companion 
data RAM B, 0 to N bytes can be written simultaneously in data RAM 208since only the 
selected ingress grains are stored. As an example of multiple writes to data RAM 208, Figure 
1 1 illustrates the case of 2 grains from ingress timeslot 0 being written into RAM #1 and RAM 
#2, 1 grain from ingress timeslot 3 is written into RAM #3, N-4 grains from ingress timeslot 7 
are written into RAM #4 to RAM #N-1, 4 grains from ingress timeslot 15 are written into RAM 
#N to RAM #3, N grains from ingress timeslot 23 are written into RAM #4 to RAM #3, etc. 
Egress processor 212 reads data from data RAM 208 using random access to properly order 
the outgoing grain group on the egress port. 

[0036] The ingress processor 206 selects and formats the ingress grains to be stored 

in data RAM 208. Figure 12 illustrates an embodiment of the ingress processor 206. The 
ingress processor 206 contains interest RAM 236 and compactor 234 Interest RAM 236 
contains G words of N bits. The compactor is made of N instances of N:1 multiplexers 238 
and a finite state machine (FSM) 240 to allow any of the N ingress grains to be written into 
any of the N data RAMs 208. Table 1 compares the MESS implementation of the present 
invention to the requirements of flip-flop and RAM based implementations of the prior art. 
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Where 


N = number of ports 

G = number of grains in a grain group 

Grain = 8 bits (typically) 

Flag = 1 bit 



Table 1 



[0037] The Ingress processor 206 selects and formats the ingress grains presented 
on the wide ingress data bus in order to build a compact brick wall in the data RAMs 208. 
The wide ingress data bus is obtained by merging all the ingress ports. 
[0038] The selection of grains by ingress processor 206 is performed so that grains 

meeting at least one criterion are selected. In a switch having N inputs, ingress processor 
206 receives N grains simultaneously. Each N grain block received corresponds to the grains 
received in a given timeslot. The particular timeslot that grains are received in defines the 
position of the grain in the grain group. Interest RAM 236 is preferably used by ingress 
processor 206 to store the criteria on which grains are selected. In a presently preferred 
embodiment, a grain is selected in accordance with the ingress port that it was received on, 
and the position of the grain in its grain group. Thus, interest RAM 236 can provide a 
selection mask to ingress processor 206 indicating the grains that should be selected from 
the N grain block. The interest RAM 236, thus would store G different masks, each mask 
corresponding to the grains to select in each of the G timeslots in the grain group. 
[0039] Interest RAM 236 is preferably synchronized with the wide ingress data bus, 

which can be achieved by monitoring the frame reference of the data bus. In one 
embodiment, interest RAM 236 sequentially reads each of the G words it stores after 
achieving synchronization. In this embodiment, each of the G words stored in interest RAM 
236 acts as a selection mask for the grains arriving at each time slot. After reading the final 
word in interest RAM 236 the reading is cycled back to the first word. Thus, each of the 
words is associated with a timeslot in a grain group. Each of the G words stored in interest 
RAM 236 is read at the same timeslot as the grains of that timeslot are received by 
compactor 234. In a presently prefen-ed embodiment, if the interest bit is set to one, the grain 
is selected. Otherwise, the grain is discarded. In other words, interest RAM 236 has G bits 
set to one (out of NG RAM bits) and those G bits con:^espond to the G ingress grains (out of 
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NG ingress grains) that must be selected by the egress port. Interest RAM 236 can be 
configured from the connection memory RAM 21 4 (shown in Figure 10). 
[0040] Compactor 234 does the data formatting of selected grains. The compactor 

234 receives all the ingress grains from the wide ingress data bus and the selection flags 
from the interest RAM 236. Between 0 and N grains can be selected in any timeslot. If 0 
grains are selected then the compactor does nothing. If n grains (1 < n < iV) are selected, 
compactor 234 removes the gaps between the selected grains in order to compact the data 
on the data bus. Then, the compacted data on the bus is barrel shifted to properly build the 
compact brick wall in data RAM 208. The selection of the desired grains, and the discarding 
of the other grains can be simply implemented through the use of a simple 'AND' operation. 
[0041] FSM 240 keeps track of the last location in the data RAMs 208 that contains 

valid data. At the next time-slot it controls compactor 234 to steer the first selected grain to 
the next free location and subsequently selected bytes to the adjacent locations, which will 
also be free. In a presently preferred embodiment, the location of a cell in data RAM 208is 
defined by the doublet (RAM number, RAM word address). 

[0042] The egress processor 212 serves to extract the grains from data RAMs 208 

and properly formats the egress grain group prior to transmission. Figure 13 illustrates an 
embodiment of the egress processor 212. Egress processor 212 contains egress processing 
RAM 242 and N:1 multiplexer 210. The egress processing RAM 242 contains G words of 

logj N+log2 — bits (rounded up to the nearest bit). Table 2 compares the MESS 

N 

implementation of the present invention to the requirements of flip-flop and RAM based 
implementations of the prior art. 



Block 


Flip-Flop 
Implementation 


RAM 

Implementation 


MESS 

Implementation 


Egress mux per egress port 


GN:1 


N:1 


N:1 


Egress processing RAM 
Storage per egress port 


None 


None 


G Addresses 
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Block 


Flip-Flop 

Implementation 


RAM 

Implementation 


MESS 

Implementation 


Where 


N = number of ports 

G = number of grains in a grain group 

Grain = 8 bits (typically) 

Address = logj N + log2 — typical 



Table 2 



[0043] Egress processor 212 extracts the grains from the data RAMs 208 and 

properly formats the egress grain group. Egress processing RAM 242 is preferably 
synchronized with the wide ingress data bus. Synchronization can be achieved by monitoring 
the frame reference. Once synchronized, egress processing RAM 242 sequentially reads 
each of its G words. Each word represents the address of a grain in data RAM 208 The 
address, as before, is the doublet of the RAM number (1 to N) and the address location (0 to 
G/N-1) in the indicated RAM. Egress processing RAM 242 contains an in-order list of the 
addresses corresponding to the grains to be extracted and presented on the egress port. 
Random access from data RAMs 208 allows the grains to be accessed In order to form a 
proper egress grain group. The egress processing RAM 242 can be configured from the 
connection memory RAM 214 (shown in Figure 10). 

[0044] The present invention provides a reduced implementation area for a switch by 

reducing the memory footprint and complex multiplexing circuitry. This reduction requires the 
implementation of additional logic such as ingress processor 206 and egress processor 212. 
However, the area occupied by these logic blocks is negligible when the grain count G is 
large. The area savings attributable to memory and multiplexing reduction and simplification 
more than offsets the area occupied by the additional logic. To demonstrate the area 
savings, the following sets of equations show the area required for a SONET/SDH 
application (a 45 G VT/TU wideband cross connect where the grain number G equals 1344 
and the port number N equals 18). 

[0045] Equations 6.1 through 6.4 define the required area for a flip-flop based 

implementation of an N x N memory switch (or a wideband cross connect). With a flip-flop 
based implementation, the wideband cross connect requires 54 600 000 gates. 
Area ff = N {1 0 x 2 x [Storage] + [Muxing] } (6.1) 
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Area ff = N {1 0 x 2 x [8 bit Grains] + [8 bit Egress Muxing] } (6.2) 
Area ff = N {10 x 2 x [8G] + [8NG log2(NG)] } (6.3) 
Where N = 18 

G = 1344 

Area ff « 1 8 {21 5 040 + 2 81 8 31 8 }« 54 600 000 gates (6.4) 
[0046] The factor 10 before the storage element accounts for the 10 gates required 

per flip-flop, while the factor 2 before the storage element accounts for the double buffering. 
[0047] Equations 7.1 through 7.4 define the required area for a RAM based 

implementation of an N x N memory switch (or a wideband cross connect). With a RAM 
based implementation, the wideband cross connect requires 20 900 000 gates. 



Area ram = N {3 X 2 X [Storage] + [Muxing] } (7.1) 

Area ram = N {3 x 2 x [8 bit Grains] + [8 bit Egress l\iuxing] } (7.2) 

Area ram = N {3 x 2 x [8NG] + [8N logz (N)] } (7.3) 
Where N=18 

G = 1344 

Area ram « 18 {1 161 216 + 600 }« 20 900 000 gates (7.4) 



[0048] The factor 3 before tlie storage element accounts for the 3 gates required per 

RAIVI bit, while the factor 2 before the storage element accounts for the double buffering. 
[0049] Equations 8.1 through 8.4 define the required area for a MESS based 

implementation of an N x N memory switch (or a wideband cross connect). With a MESS 
based implementation, the wideband cross connect requires 5 500 000 gates. 
Area MESS = N {3 X 2 x [Storage] + [Muxing] } (8.1) 
Area mess = N {3 x 2 x [8 bit Grains + interest + Egress] + [8 bit compactor + 8 bit (8.2) 
Egress Muxing]} 

Area mess = N {3 x 2 x [8G + NG + G log2(N) + G logz (G/N)] + [8N2 Loga (N) + 8N (8.3) 

log2(N)]} 
Where N=18 

G = 1344 

Area mess « 1 8 {293 467 + 1 1 408} « 5 500 000 gates (8.4) 
[0050] The factor 3 before the storage element accounts for the 3 gates required per 
RAM bit, while the factor 2 before the storage element accounts for the double buffering. 
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[0051] Using the above calculated values, the area saving in relation to the prior art is 

at least a 73% area reduction in comparison to either the flip-flop or the RAM based 
implementation when the grain count G is large. 

[0052] The embodiment of the invention as described above utilizes a connection 
memory RAM 214 to allow an operator to configure the interest RAM 236 and the egress 
processing RAM 242. One skilled in the art will appreciate that the connection memory RAM 
214 is merely preferably employed in the system of the present invention. The connection 
memory RAM 214 provides a convenient tool to allow the user to program the memory 
switch by defining the egress doublet (egress grain, ingress grain) for each egress grain. 
Nevertheless, the connection memory RAM 214 can easily be removed and both the interest 
RAM 236 and the egress processing RAM 242 can be configured directly via a 
microprocessor interface. 

[0053] One skilled in the art will also appreciate that the MESS egress port 204can 

be implemented with an N:M multiplexer in place of N:1 multiplexer 210. This allows for a 
single MESS egress port 204 to replace a plurality of egress ports, which avoids duplicating 
control logic such as frame counters and synchronizing circuitry. Such an implementation 
requires the interest RAM 236 and the egress processing RAM 242 to be accordingly 
adjusted. The interest RAM 236 should store a mask that selects all grains destined for one 
of the plurality of egress ports served by the MESS egress port 204. In this embodiment, the 
egress processing RAM 242 stores M-plets of data for selecting the addresses of the M 
grains required at a single time. This implementation preferably utilizes a multiple read data 
RAM 208 that allows M addresses to be read simultaneously. This can be implemented 
through a number of known memory configurations. 

[0054] The above described system can be used in conjunction with any system 

where data must be switched from ingress sources to egress destinations. One skilled in the 
art will appreciate that it Is preferably used in systems where the number of grains in a grain 
group is sufficiently large so as to provide an area savings. However, if area consumption is 
not a concern, it can be used in very small grain group scenarios. 

[0055] The above-described embodiments of the present invention are Intended to be 

examples only. Alterations, modifications and variations may be effected to the particular 
embodiments by those of skill in the art without departing from the scope of the invention, 
which is defined solely by the claims appended hereto. 
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